Phoneme Modeling for Speech Recognition in Kannada using Multivariate Bayesian Classifier
نویسندگان
چکیده
We build an automatic phoneme recognition system based on Bayesian Multivariate Modeling which is a static scheme. Phoneme models were built by using stochastic pattern recognition and acoustic phonetic schemes to recognise phonemes. Since our native language is Kannada, a rich South Indian Language, we have used 15 Kannada phonemes to train and test these models. As Mel – Frequency Cepstral Coefficients (MFCC) are well known acoustic features of speech, we have used the same in speech feature extraction. Finally performance analysis of models in terms of Phoneme Error Rate (PER) justifies the fact that though static modeling yields good results, improvization is necessary in order to use it in developing Automatic Speech Recognition systems. KeywordsBayesian Classification, Kannada, MFCC, Pattern Recognition; PER, Phoneme Modeling
منابع مشابه
Allophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملPhoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملThe Meta-pi Network: Connectionist Rapid Adaptation for High-performance Multi-speaker Phoneme Recognition
We present a multi-network Time-Delay Neural Network (TDNN)based connectionist architecture that allows us to perform multispeaker phoneme discrimination (/b,d,gh at the speaker-dependent recognition rate of 98.4%. The overall network gates the phonemic decisions of modules trained on individual speakers to form its over-all classification decision. By dynamically adapting to the input speech a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014